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1 .  Introduction 

We  consider  the  following  two  problems. 

( 1 )  The  linear  recursion  problem 

A  linear  recursive  program  is  a  set  of  ALGOL-like  procedures 
each  of  which  contains  at  most  one  procedure  call.  All  parameters  are  passed 
by  value.  There  is  one  specified  procedure  that  is  evaluated  with  certain 
given  inputs.  The  problem  is  to  compile  a  given  linear  recursive  program  into 
an  efficient  program  without  recursion.  The  reader  may  wonder  at  this  point 
what  is  wrong  with  the  conventional  stack  implementation  of  recursion  since 
that  represents  about  as  fast  as  one  can  hope  to  go.  The  problem  is  that  it 
takes  a  great  deal  of  space.  In  this  context  efficiency  refers  to  space 
efficiency.  On  the  other  hand,  most  compiler  writers  are  aware  of  an 
implementation  that  requires  space  for  just  one  or  two  values  but  takes  a 
great  deal  of  time  -  Proportional  to  the  square  of  the  recursion  depth.  On 
comparison  with  this  algorithm  efficiency  refers  to  time  efficiency. 

For  a  treatment  of  some  problems  related  to  the  linear 
recursion  problem  see  Chandra  [1972]. 


0  (?)  The  schema  problem 

A  schema  is  a  program  in  which  the  base  functions  and  predicates 
are  left  uninterpreted.  A  schema  along  with  a  given  interpretation 
p  characterizes  a  computation.  The  following  recursive  definition  specifies 

a  schema: 

Compute  F(a)  where 

»  F (y )  -  if  p (y)  then  h(y)  else  g (y ,F ( f (y) ) ) . 

This  schema  has  been  considered  in  some  form  or  the  other  by 
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several  authors  e.g.,  Paterson  and  Hewitt  [1970]  ,  Hewitt  [1970],  Garland  and 
l.uckham  [I97I],  Strong  and  Walker  [1972].  The  base  functions  f,g  and  h, 
the  individual  constant  a,  and  the  predicate  p  are  not  interpreted.  In  this 
schema  there  is  some  implicit  storage  allocation,  i.e.,  the  value  of  y  is 
stored  while  F(f(y))  is  computed,  and  the  two  are  then  used  to  obtain 
g(y ,F( f (y) ) ) .  The  problem  is  to  translate  (or  compile)  this  recursive 
schema  into  an  efficient  flowchart  schema  that  does  its  storage  allocation 
explicitly. 

We  solve  the  linear  recursion  problem  by  first  converting  it  to 
the  schema  problem  and  then  solving  that.  In  some  sense  the  schema  problem 
looks  like  a  simplified  version  of  the  linear  recursion  problem  where  there 
is  just  one  procedure  which  consists  of  a  single  if-then-else  statement. 

However,  the  use  of  a  schema  as  an  intermediate  step  in  the  solution  has  some 
other  bonuses  stemming  from  the  fact  that  the  base  functions  and  predicates 
of  the  schema  are  uninterpreted.  Hence,  by  specifying  appropriate  inter¬ 
pretations  one  immediately  obtains  solutions  for  several  different  problems 
that  can  be  modelled  by  the  schema  (as  shown  in  Appendix  i) . 

Section  2  defines  the  schema  problem  and  the  linear  recursion  problem 
in  somewhat  greater  detail  and  shows  how  the  latter  can  be  reduced  to  the 
former.  The  reader  may  safely  omit  this  section,  though  it  may  be  desirable 
to  read  subsections  2.1  and  the  first  part  of  2.2,  which  define  the  schema 
problem  and  the  linear  recursion  problem.  Section  presents  the  main  results 
of  this  paper.  Efficient  solutions  for  the  schema  problem  are  given,  and  space- 
time  tradeoffs  are  considered.  Section  4  demonstrates  some  practical  aspects 
of  these  results.  In  Appendix  I  (Section  5)  we  mention  two  other  problems  that 
can  be  reduced  to  the  schema  problem.  Most  of  the  detailed  algorithms  are  not 
inserted  in  the  mainstream  of  the  paper  to  allow  for  ease  in  reading.  These 
are  given  in  Appendix  II  (Section  6). 
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2  .  Translation  of  the  Linear  Recursion  Problem  to  the  Schema  Problem 
1  A  Definition  of  the  Schema  Problem 

A  flowchart  schema  has  a  finite  number  of  variables.  We  use  the 
symbols  Yj.Yp.y  ,  ...  to  represent  variables.  The  schema  we  consider  has  a 
zero-ai  motion  a  (i.e.,  an  individual  constant),  a  unary  predicate  p, 

^ary  functions  f  and  h  and  the  binary  function  g.  Statements  in  the  schema 
are  of  uie  following  types: 

Start  statement:  START 

Halt  statement:  HALT(y) 

Assignment  statement:  y  -  a 

yt  -  f(yj) 
yi  -  -yk> 
y±  -  h(yj) 

Predicate  test:  if  p(y)  then  goto  L^  else  goto  L, 

where  and  L  are  arbitrary  labels.  Any  statement  may  be  labelled,  and 
unconditional  goto  statements  are  allowed.  While  the  scheme  is  called  a 
"flowchart  schema"  for  the  reason  that  it  can  be  represented  as  a  flowchart, 
we  use  the  more  compact  and  convenient  ALGOL-like  notation.  We  also  allow  the 
use  of  block  structures  and  "while-statements"  with  the  understanding  that 
these  features  may  be  translated  by  using  goto  statements  to  get  a  "legal" 
schema . 

A  flowchart  schema  with  arrays  has  the  following  additional  features 

C1’C2’C 6*  "  "  ‘  counters  that  can  have  nonnegative 

integer  values 

A1 ,A? ,AJ ’ * ’ ‘  one-dimensional,  semi-infinite  arraj s 

that  may  be  subscripted  with  counters. 

For  convenience  we  will  frequently  use  identifiers  other  than  those  given 
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above  for  counters,  variables  and  arrays.  For  this  reason  the  artifice  of 
declarations  can  be  used  to  clarify  the  meaning  of  identifiers,  where 
necessary.  The  type  data  is  used  to  identify  variables,  counter  for 
counters,  and  array  for  arrays.  The  following  operations  on  counters  and 
arrays  are  allowed: 

Counter  operations:  c  •-  c  +  1 

c  *-  c  -  1 

c  .  •-  c  . 

1  J 

if_  c  =  0  then  goto  else  goto 

Array  operations:  y  «-  A[c] 

A[c]  -  y 

The  execution  of  each  statement  is  assumed  to  take  a  unit  amount  of 
time.  The  space  required  is  the  number  of  variables,  counters  and  the 
number  of  array  locations  from  zero  to  the  maximum  ever  referenced. 

We  may  consider  the  use  of  additional  statements  that  may  be  assumed 
to  take  unit  amounts  of  time  each.  An  example  is  the  test  of  equality 
between  two  variables.  This  case  is  briefly  considered  by  Chandra  and  Manna 
[1()7:].  Other  examples  include  the  comparison  of  the  values  of  two  counters, 
halving  and  doubling  the  values  of  counters,  multiplying  two  counters  etc. 
While  these  operations  are  certainly  useful,  their  direct  application  in  the 
algorithms  to  be  described  results  only  in  a  factor  in  time  efficiency 
(though  equality  tests  can  also  detect  looping) . 

The  given  recursive  schema  is 

Compute  F(a)  where 

F(y)  -  if  P(y)  then  h(y)  else  g(y,F(f (y))) . 

The  problem  is  to  translate  this  into  an  efficient  equivalent  flowchart 
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schema  (with  and  without  arrays). 

For  any  interpretation,  if  the  ;  3  an  integer  n  such  that: 

P(fn(a))  is  true, 

and  Yk  <  u  p(f^(a))  is  false, 

then  the  term  to  be  computed  is 

g(a>g(f(a) >g(f (f(a)) ,  •••  g(fn_1(a) ,h(fn(a)))  ...  ))). 

If,  for  any  interpretation,  no  such  n  exists  then  the  schema  diverges  for 
that  interpretation.  The  time  and  space  bounds  will  be  considered  in  terms 
of  n . 

!'.?  Reduction  of  the  linear  recursion  problem  to  the  schema  problem 

A  linear  recursive  program  is  a  set  of  ALGOL-like  procedures 
having  the  following  features: 

(a)  There  are  no  global  variables.  This  imposes  no  constraints 
because  we  allow  procedures  to  return  a  vector  of  arguments. 

(b)  Each  procedure  is  loop-free.  Loops  can  be  eliminated  by 
using  recursive  calls  instead  (McCarthy  [1962]).  However, 
standard  techniques  exist  for  implementing  iteration;  and 
we  do  not  wish  to  complicate  matters  by  considering  this 
case  as  we  11 . 

(c)  Each  procedure  can  call  at  most  one  other  procedure.  This 
is  the  crucial  reason  why  the  program  is  called  "linear". 

Now,  given  a  linear  recursive  program  we  will  first  "erase"  the 
definitions  of  the  base  functions  and  predicates  to  get  a  linear  recursive 
schema.  The  reason  for  this  is  that  we  are  not  interested  in  the  detailed 
implementation  of  the  basic  operations  like  addition,  multiplication,  cons 
(in  LISP),  test  for  zero,  etc. 

5 
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A  linear  recursive  schema  is  a  set  of  non-looping  procedures. 

Ihe  first  statement  of  each  procedure  is  the  start  statement.  Subsequent 
statements  may  be  assignment,  test,  or  return  statements.  As  mentioned 

we  will  use  ALGOL-like  notation  to  represent  the  flow  of  control. 

The  variables  used  in  the  procedure  body  may  be  formal  parameters  (called 
by  value)  or  local  variables.  No  global  variables  are  allowed.  The  "type" 
of  a  variable  may  be  data  or  boo  lean  .  Data  variables  may  take  values 
from  some  domain  Dl  (to  be  specified  with  the  interpretation)  ,  whereas 
boolean  variables  take  values  from  the  domain  B  =  { true  ,  false)  .  Further, 
procedures  may  return  a  vector  of  values.  This  is  important  because  side 
effects  are  not  allowed.  The  same  effect  could  be  achieved  (very  inefficiently) 
by  calling  several  procedures  in  sequence  each  of  which  returns  a  single  value; 
but  even  this  mechanism  is  not  available  to  us  since  linear  recursive  schemas 
can  have  at  most  one  procedure  call  in  every  path  from  the  start  statement 
to  a  return  statement.  It  is  assumed  that  the  given  linear  recursive  schema 
has  no  illegal  reference,  i.e.,  no  variable  is  referenced  unless  it  has 
been  assigned  a  value.  This  assumption  is  not  strictly  required  for  we 
can  augment  the  domains  and  B  by  adding  one  "undefined  element"  to  each. 

This  represents  only  minor  modifications  of  the  discussion  below. 

The  given  linear  recursive  schema  is  to  be  reduced  to  the 
standard  form 

S;  Compute  F(a)  where 

F(y)  -  if  p (y )  then  h(y)  else  g(y ,F(f (y))) . 

The  reduction  to  the  standard  form  is  effected  in  two  steps: 

1.  If  the  given  recursive  schema  has  d  distinct  procedure 
definitions,  combine  them  into  one  procedure  by  adding  log^(d)  boolean 
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variables  which  are  tested  at  the  start  of  the  new  procedure.  Also,  the 
vector  of  arguments  and  the  vector  of  values  returned  are  padded  with  arbitrary 
constants  so  that  they  both  have  the  same  type,  say  D^S  x  BC ;  i.e.,  the 
vector  may  be  represented  as  <yL  ,y2  , .  .  .yg  .z^Zg  , . .  .zt>  where  all  the  y's 
are  of  type  data  and  all  the  z's  are  of  type  boolean.  The  single  procedure 
obtained  in  this  manner  will  be  called  Fg ,  and  the  schema  (the  procedure  F,, 
along  with  the  initial  values)  will  be  called  S  . 

.  Now,  given  the  schema  and  the  interpretation  I  for  its 
functions  and  predicates  (over  the  domain  D^)  it  is  our  objective  to  produce 
an  interpretation  I  (over  some  domain  D)  for  the  constant  a,  the  functions 
and  the  predicate  p  such  that  the  schema  in  standard  form  effectively 
computes  the  same  value  as  SQ .  It  is  a  requirement  that  the  time  taken  to 
compute  the  new  base  functions  a,f,g,h  and  the  predicate  p  be  dependent 
only  on  the  schema  and  not  on  the  number  of  recursive  calls  required  for 
its  computation  under  1^  (with  the  usual  assumption  that  the  base  functions 
and  predicates  in  1^  take  unit  time  to  compute) .  If  this  requirement  is 
satisfied  then  the  assumption  (in  the  schema  problem)  that  each  basic 

statement  takes  a  unit  (or  bounded)  amount  of  time,  is  justified. 

S  t 

The  domain  D  is  x  B,  i.e.,  each  element  in  the  new  domain 

is  a  vector  of  data  and  boolean  values. 

The  zero-ary  function  a  is  defined  to  be  that  element  of  D 
which  is  the  vector  corresponding  to  the  arguments  initially  supplied  to 
the  procedure  F,  in  the  schema  Sp  . 

The  predicate  p  is  defined  as  follows:  for  any  vector 
v  c  D,  v  • . .  ,yg,z1, .  . .  ,zt>,  if  Fg  (yf , . . .  ,yg  ,Zi , . . .  ,zt)  executes 

a  recursive  call  then  p(v)  is  false,  else  p(v)  is  true. 
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The  function  h  is  defined  as  follows:  for  any  vector 
v  e  D,  v  =  >,  if  p(v)  is  false  then  h(v)  is  arbitrary,  but  if 

p(v)  is  true  (i.e.,  there  is  no  recursive  call)  then  h(v)  is  the  vector 
returned  on  execution  of  F^ (y^ , . . .  ,z^) . 

The  function  f  is  defined  as  follows:  for  any  vector 
v  £  D,  v  =  <y^,...,zt>,  if  p(v)  is  true  then  f(v)  is  arbitrary,  else  it 
is  the  vector  argument  of  the  recursive  call  to  , 

The  function  g  is  defined  as  follows:  for  ary  v^,voe  D, 
v^  =  <y^,...,zt>,  if  p(v^)  is  true  then  g(v^,v^)  is  arbitrary,  else  it 
is  the  value  that  would  be  returned  by  F  (y z  )  if  v0  were  substituted 
for  the  value  of  the  one  recursive  call  executed. 

For  somewhat  greater  detail  the  reader  is  urged  to  examine  the 
example  presented  below. 

The  interesting  thing  about  the  translation  presented  above  is 
that  it  is  reversible.  Given  I  and  the  schema  S,  or  a  schema  equivalent 
to  it,  one  can  substitute  the  values  of  the  constant  a,  the  functions  f,g,h 
and  the  predicate  p  to  obtain  a  schema  equivalent  to  the  original  schema 
S^.  In  this  manner,  once  we  have  an  efficient,  non-recursive  flowchart 
program  equivalent  to  S ,  on  substitution  we  obtain  an  efficient  program 
equivalent  to  S^. 

It  should  be  noted,  however,  that  the  word  "efficient"  as  used 
above  denotes  only  the  order  of  space-time  dependence  on  n  -the  number  of 
recursive  calls.  Blind  substitution  for  the  constant  a,  the  functions  f.  ,g,h 
and  the  predicate  p  would  result  in  some  redundant  computations,  for  examp  e , 
in  a  computation  of  f(v)  immediately  following  p(v)  (see  the  example 
below).  A  practical  compiler  based  on  this  method  would  find  it  relatively 
simple,  however,  to  avoid  this  duplicated  effort. 
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The  given  linear  recursive  schema  has  the  following  base  functions 
a^  a  zero-ary  function  (an  indivi  ’ual  constant) , 
f^  a  binary  function, 

fg  a  ternary  function,  and 

p^,p  unary  predicates. 

The  schema  is: 

Sx :  Compute  yx  where  <y1,Y?>  -  F^^); 

F,.(y, ,y„)  -  START  data  y„ ;  boolean  z„  ; 

!  a  i  j  -  1 

yi  *■ 

il  Pi(vi) 

then  begin  <y7,zx>  -  Flb (yp ) ; 

—  Z1 

then  RETURN (y^) 

else  RETURN(f2(yi,y?,y  ))  ; 

end 

else  if  P2(>’2) 

then  RETURN (y0) 

else  begin  y^  -  F^y^yg); 

RETURN (y) ; 
end  .  ^ 

F„,  (y, )  <-  START  data  y..  ,  boolean  z,  ; 
lbwl'  -  J2  -  1 

il  P2  (yj) 

then  begin  <y2^1>  -  Flb(yi); 

RETURN (yp ,true) ; 

end 

else  begin  yn  -  Fla(y1»y1); 

RETURN(y^ , false) ; 

end 
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Stop  1:  a  boolean  variable  z,,  is  added.  z„  =  true  signifies  a  call 

Cl  C- 

to  F,  ,  z„  =  false  signifies  a  call  to  F,.  .  The  boolean  variable  z,  (below) 
la  ?  lb  1 

plays  the  role  of  z,  in  the  above  definitions  of  F,  and  F1t  .  A 

1  la  lb 

"redundant"  data  variable  is  added  to  match  the  padded  data  element  in 
the  vector  returned.  Both  the  argument  vector  and  the  return  vector  have 
type  x  x  B.  The  resulting  schema  Sn  is: 

S,:  Compute  y^.  where  ^.y^.zp  -  F2  ^  ,a  ,  false)  : 


(1)  -  "  - 

(2)  -  -  - 

(3)  -  -  - 


(H)  -  -  - 

(5)  - 

(6)  - 

(7) - 


START  data  y^  ,y,  ;  boolean  z,  ; 

-  ^  I 

il 

then  bggin  -  fl^yl’yy); 

il  Pi(yi) 

then  begin  <y^,y1+,z1>  -  F  (yp  .false)  ; 

il  z1 

then  RETURN (y,a1, true) 

else  RETURN (f^ (y  y  ,y  )  ,a. ,true)  ; 

d  9  <-  J  -L 

end 

else  if  p,,  (y0) 

then  RETURN (y,,  ,a^  ,  true) 

else  begin  <y.,,y^,z1>  -  F,  (yL  ,y?  ,  true)  ; 

I  ETURN(y^ ,a^ ,true) ; 

end ; 

end 

else  begin  if  p  (y^ 

then  begin  <y2  ,y^  .zp  -  F  ,  (y}  ,&l ,  false)  ; 
RETURN(y, ,a^ ,true) ; 

end 
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else  begin  <yg  ,y^  ,z^>  -  F^j^  ,  ue) 

RETURN (y^  ,a  ,falie)  ; 

end 

(8)  -  -  -  end . 

Lines  (1)  -  (5)  effectively  define  F^a,  and  lines  (6)  -  (8)  define 
•  Line  (2)  invokes  a  call  to  F^*  Since  F  takes  only  one  argument, 
the  value  a^  is  padded.  Line  (4)  is  a  call  to  F^.  F^a  really  returns 
just  one  value.  So  return  statements  such  as  in  line  (3)  are  padded  with 
two  elements:  a^  and  true.  F^  returns  2  values,  so  only  one  value  needs 
to  be  padded  when  F^  returns:  e.g.  a^  in  line  (7)  is  the  padded  element. 

Step  2 :  the  standard  schema  is: 

S:  Compute  F(a)  where 

F(y)  -  ii  p(y)  then  h(y)  else  g(y,F(f(y))) . 

The  required  interpretation  I  for  S  follows.  The  definitions  of  p,h  and  g 
below  can  be  simplified.  We  choose  not  to  do  so  for  the  sake  of  clarity. 

D  =  Dj,  x  x  B 
a  =  <a^ ,a^ , false> 

p(<y1»y2,z2>)  =  if  z2 

then  if  P1(f1(y1,y2)) 
then  false 
else  if  P2(y2) 
then  true 
else  false 
else  if  p2(y1) 
then  false 
else  false 
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h(<y1,y^,z2>)  =  if  z2 

then  if  P1(f1(y1.y2)) 
then  arbitrary 
else  if  P2(y2) 

then  <y2 , a^ , true> 
else  arbitrary 
else  if  p2(yL) 

then  arbitrary 
else  arbitrary 

f (<yl»y?5z2>)  =  if  z2 

then  if  P1(f1(y1»y2)) 
then  <y2 ,a^ , false> 
else  if  P2(y2) 

then  arbitrary 
else  <f1(y1,y2),y2,true> 
else  if  Pp(y1) 

then  <y^ ,a  , false> 
else  <y^,y^,true> 

g  (<yx  >yp  ,z2>,  <wL  ,w2  .x^)  = 

if  Z2 

then  if  P1(f1(y1>y2)) 
then  if  x^ 

then  <w^,a^,true> 
else  <f2(f1(y1»y2)  »y2>w1),a1,true) 
else  if  P2(y2) 

then  arbitrary 
else  <w^,a^,true> 
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3 •  Solutions  for  the  Schema  Problem 
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3 . 1  Introduct ion 

In  the  previous  sections  it  was  shown  that  the  linear  recursion 
problem  can  be  reduced  to  the  schema  problem.  Appendix  I  presents  two 
other  problems  that  can  be  converted  to  the  schema  problem.  Algorithms 
for  this  problem  are  now  described,  and  the  other  problems  can  be  solved 
by  substituting  the  appropriate  interpretations  for  the  functions  and 
the  predicate  of  the  schema. 

The  schema  is: 

Computer  F(a)  where 

F(y)  -  ii  p(y)  then  h(y)  else  g(y ,F( f (y) ) ) . 

Let  n  be  the  depth  of  recursion  i.e.,  n  is  the  smallest  integer 
for  which  p(f  (a))  is  true.  Then  the  computation  may  be  represented  by 
the  following: 

Define  t(n)  to  be  h(fn(a)). 

Vi<n  define  t(i)  to  be  g(  f^a)  ,t(i+l)  )  . 

Then  the  desired  output  is  t(0)  -  assuming,  of  course,  that  n  exists, 

for  if  n  does  not  exist  then  the  schema  loops  forever.  Essentially  this 
rule  is  used  in  all  algorithms  below.  They  differ  only  in  the  way  they 
compute  f1(a). 

The  standard  implementation  uses  a  stack: 

START 

counter  c;  array  A; 

c  ♦—  0  5 

x  •-  a; 
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-  -  -  while  -i  p(x)  do 


begin 


A[c]  ~  x; 

X  -  f(x)  ; 

c  *-  c  +  1 ; 


y  -  h(x) ; 
while  c  f  0  do 


begin 


c  c  -  1 ; 
y  -  g(A[c] ,y) ; 


HALT ( y ) . 


The  array  A  acts  as  push-down  stack.  The  first  loop  (lines  (l)  -  (2)) 
implements  recursive  calls,  and  the  second  loop  (lines  (3)  -  (4))  pops  the 
stack  to  compute  the  final  value. 

This  implementation  takes  time  and  space  both  proportional  to 
n  -  the  number  of  recursive  calls. 

]t  is  also  well  known  that  the  recursion  can  be  implemented  using 

O 

only  a  constant  amount  of  memory  and  time  proportional  to  n  (see  for 
example  Garland  and  Luckham  [1971])-  One  program  that  avoids  even  the  use 
of  counters  is  presented  below.  In  this  program  the  variables  w,  and  w 
effectively  play  the  role  of  counters.  Counters  are  implemented  by  letting 
the  term  f1  a)  (i  <  i)  represent  the  value  n  -  i.  A  counter,  e.g.,  w^ , 
can  be  set  to  its  maximum  value  n  by  an  assignment  statement  w^  *-  a,  the 
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counter  can  be  decremented  by  W;L  -  f^),  and  it  can  be  tested  for  zero  by 
P(w1)  • 

START 
x  *-  a> 

while  -i  p (x)  do  x  -  f(x)  ; 
y  -  h(x) ; 
wx  a; 

while  — ,  p(w^)  do 
begin 

w!  -  f(w1) ; 
x  <-  a; 
w?  -  wx; 

while  -i  p(wp)  do. 
begin 

x  »-  f(x) ; 

W2  *"  f(w2); 

end ; 

y  -  g(x,y) ; 

end ; 

HALT(y) . 

It  is  shown  in  this  paper  that  with  a  constant  amount  of  memory 
•  1  1 

the  time  can  be  brought  down  to  n  for  any  arbitrarily  small  positive  e. 

This  answers  in  the  negative,  a  conjecture  due  to  Hewitt  [ 1970]  that  n  is 
the  best  one  can  hope  to  do.  Further,  to  solve  the  problem  in  tune  proportional 

to  n  one  does  not  need  space  proportional  to  n  as  in  the  stack  implementation, 

e 

just  n  .  It  is  also  satisfying  that  there  exists  an  algorithm  whose 
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space-time  tradeoff  falls  'Midway"  between  these  two  algorithms;  it  takes 
space  proportional  to  log(n)  and  time  proportional  to  n.log(n). 


Time 

Space 

n 

n 

stack  implementation 

n 

e 

n 

linear-time  algorithm  (sec. 

3-2) 

n  log  n 

log  n 

log(n)  algorithm 

(sec . 

3-3) 

l+€ 

n 

1 

constant -space 
algorithm 

(sec. 

3-4) 

P 

n 

1 

conventional  constant 
program 

space 

')  .2  Linear-time  algorithm 

The  main  problem  encountered  in  flowcharting  the  given  recursive 
schema  is  that  of  inverting  the  function  f  i.e.,  given  a  value  fi+^(a), 
to  find  f1(a).  This  cannot  be  done  directly  because  the  function  f  is 
not  invertible.  The  stack  implementation  solves  this  problem  by  storing  all 
the  values 

a,  f(a),  t  (a),  ...,  fn(a), 


and  picking  them  off  in  reverse  order.  The  constant-space  algorithm  above, 
on  the  other  hand,  doesn't  really  save  any  values  as  such,  but  rather  computes 

a>  i(a)>  f  (a),-*-af  (a),f  (a).  A  control  mechanism  is  used  to  keep  the 

count . 


Between  these  two  extremes  of  saving  all  values,  and  saving  none, 
there  exist  schemes  for  saving  an  intermediate  number  of  carefully  spaced 


values.  We  first  informally  describe  an  algorithm  that  is  linear  in  n 
but  requires  space  proportional  to  n1/2  (order-2  algorithm).  We  then 
present  a  generalization  of  this  algorithm  which  takes  space  nl/k  (order-lf 
algorithm).  Details  are  given  in  sec.  6.1  of  Appendix  II. 

Order-2  algorithm 

We  use  the  notation  v(i)  to  stand  for  the  term  f1(a).  Thus  v(o) 
stands  for  the  constant  a  itself. 

Let  n  be  a  perfect  square  and  let  m  denote  /nT  In  the 
initialization  phase  the  following  values  are  calculated  and  saved: 
v(0),  v(m),  v(2m) , . . .  ,v((m-2)*m)  , 

v(  (m-1)  *m)  ,  v(  (m-l)  *m+i)  , .  .  .  ,v((m-l)  *m+m-l) . 

Now  the  first  m  values  (right  to  left)  can  be  picked  off  and  used  in  the 
computation  of  the  final  output.  Each  step  takes  a  constant  amount  of  time. 
The  following  values  are  left: 

v (0)  ,  v(m),  v(2m)  ,. . .  ,v((m-2)*m)  . 

A  redistribution  can  now  be  performed  to  compute  and  save  the  values 
v ( (m-2)  *m+l)  ,  v(  (m-2)  *m+2)  , . . .  ,  v(  (m-2 ) *m+m-l)  . 

Another  set  of  m  values  can  now  be  picked  off  before  a  second  distribution 
is  required;  and  so  on. 

The  following  are  the  contributions  to  the  computation  time: 

1.  The  first  initialization  phase  -  takes  time  proportional  to  n. 

2.  Picking  off  values  -  n  steps,  taking  constant  time  each  - 
total  proportional  to  n. 

7).  Redistribution  -  /n~  steps,  each  taking  time  proportional 

to  /n  -  total  proportional  to  n. 

Thus  the  overall  algorithm  is  linear,  and  takes  space  2  *  Jn~  .  We 


call  the  above  an  order-2  linear-time  algorithm.  It  can  be  generalized 
to  an  order-k  linear-time  algorithm  as  follows. 

Order-k  algorithm 

Let  n  >  0  be  a  power  of  k,  and  let  m  denote  n1^. 

,l)  The  initialization  phase:  compute  and  save  the  values 
v(0),  v(n-(1n-l).11>k-1)  ,  v^-^-2)./-1) . v(n-2mk-1), 

v(nn.nk"-)  ,  v(n-(m-l).mk"2) . v'n-2mk"2)  , 

v(n-m) ,  v  (n-  (m-1) )  ,  .  ..,  v(n-2),  v(n-i), 

Set  counters  ^  -  c?  -  . . .  -  Cr  m-1. 

Set  y  -  h (v (n) )  . 

(  1  )  The  main  computation  phase:  "pick  off”  the  last  m  values,  i.e., 

for  m  steps  do  y  -  g(x,y),  where  x  takes  on  the  saved  values 
from  right  to  left. 

(;’)  Redistribution  phase: 

0*1)  Level  1  redistribution: 

If  c^=  0  then  goto  step  J,.2. 

*  $ 
Redistribute  m-1  new  values  by  steps  of  1  using  the  latest 

saved  value. 

ci  -  ci  -1- 
Goto  step  2 . 

(j  •?)  Level  2  redistribution: 

If  cp  =  0  then  goto  step  3.3. 

Redistribute  m-1  new  values  by  steps  of  m  using  the 
latest  saved  value. 


Goto  step  3.I. 
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(3-3)  Level  3  redistribution: 


I 


I 


%  (3-k- 


* 


> 

00 


► 


i 


If  c  =  0  then  goto  step  3.4. 

2 

Redistribute  m-1  new  values  by  steps  of  m  using  the  latest 
saved  value. 

c3  -  c3  '  1 
C2  *“  m  * 

Goto  step  3*2 . 


1)  Level  k-1  redistribution 
If  0  then  goto  step  4. 

k“2 

Redistribute  m-1  new  values  by  steps  of  m  using  the  latest 
saved  value. 

ck-i  -Vi  -1- 
Ck-2 

Goto  step  3.k-2. 


HALT(y) 
v (o)  ... 


Figure  1 
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Figure.  1  shows  the  values  saved  on  the  initialization  phase. 

Figure  shows  the  computations  in  somewhat  greater  detail  for  the  order-1, 
the  order-1,  and  the  order-4  algorithms  with  n  =  16.  In  Figure  2,  adjacent 
squares  represent  values  a ,  f  (a)  , .  . .  ,  f  ^  (a) .  Squares  marked  X  represent 
values  that  have  already  beeii  used  in  building  up  the  final  output.  Up-arrows  t 
denote  values  that  are  saved  at  that  stage  of  the  algorithm.  Note  that 
the  order-1  algorithm  is  precisely  the  conventional  stack  implementation  of 
recursion . 

1  1  2  -j  4  16 


t  t  t  t  r  t  t  t  t  t  t  t  t  t  t  t 
Order-]  linear-time  algorithm 


All  values  are  saved 


Figure  a 


f  f  t  t  After  initialization 


XXX  XX, 


t  T  t  T 


After  first  redistribution 


X  X  X  X  XXXI  x  xl 


t  t  T  t 


After  second  redistribution 


X  ,X  X  X  X  X  X  I  X  X  X  X'  X  X 


t  t  t  T 


After  last  redistribution 


Order-2  linear-time  algorithm 


Figure  2b 
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After  first  level-1  ied^st. 


After  first  level-2  redist. 
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Order-^  linear-time  algorithm 


After  first  level-3  redist. 


After  subsequent  level -2 
and  level-1  redistributions. 


Figure  2c 


LL&ure  J. 


The  discussion  above  is  a  simplified  version  ignoring  the 
important  case  when  n  is  not  the  k-th  power  of  any  integer.  The 
total  algorithm  is  described  in  Appendix  II.  The  main  differences  from 
the  simplified  algorithm  are: 

(1)  the  initialization  phase  is  somewhat  more  complex, 

(2)  level-k  redistributions  too  are  called  for. 
Nevertheless,  the  space-time  considerations  below  remain  valid. 


Space  requirements 

The  maximum  number  of  saved  values  in  the  simplified  algorithm 
1/k 

is  (k-l).n  +1.  For  the  general  algorithm  the  number  of  saved  values  is 

at  most 


(k-l) .  |_n1//k  j  + 


A  small  amount  of  extra  storage  is  required  for  counters  (proportional  to  k) 
and  some  variables  for  manipulating  values  (constant  number)  . 

Thus,  asymptotically,  the  data-space  requirement  is  proportional 

1/k 


to 


k.  n 


1 


Time  requirements 


following : 


The  main  components  contributing  to  the  running  time  are  the 

1.  The  initialization  phase:  proportional  to  n. 

2.  The  computation  phase:  n  steps,  each  taking  constant 

time  -  total  proportional  to  n. 

1-l/k 

3.  Level  1  redistribution:  n  '  steps,  each  taking  time 

l/k 

n  '  -  total  proportional  to  n. 

b.  Level  2  redistribution:  n*'  c  ^ 


steps,  each  taking  time 


n 


?/k  - 


total  proportional  to  n. 


k+1.  Level  k-1  redistribution: 
total  proportional  to  n. 

Thus  the  total  running  time  is  proportional  to 


steps,  each  taking  time  ri 


1-l/k 


k.n. 


In  summary 

There  exist  linear-time  algorithms  that  take  space  significantly 

l/k 

less  than  the  conventional  stack  algorithm  -  merely  n  '  where  k  is  the 
order  of  the  algorithm.  It  is  interesting  to  note  that  the  constant  of 
proportionality  increases  linearly  with  k.  The  running  time  too  increases 
linearly  with  k.  It  is  for  this  reason  (i.e.  the  constants  of  proportionality 
do  not  increase  too  rapidly  with  k)  that  high-order  algorithms  can  be 
appealing  in  practice. 


r 
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The  logfn)  algorithm 


3  •  ') 

Before  we  go  on  to  describe  a  class  of  constant -space 
algorithms  we  mention  an  algorithm  lying  between  the  linear-time  and 
the  constant -space  algorithms.  It  can  be  approximately  described  in  terms 
of  the  class  of  linear-time  algorithms  as  follows:  if  n  is  the  depth  of 
recursion  and  p  =  log2 (n)  then  execute  the  order-p  linear-time  algorithm. 

The  idea  behind  the  method  is  the  following.  We  convert  the 
given  linear  recursive  program  into  a  "nonlinear"  recursive  program  which 
is  then  implemented  by  the  usual  stack  methoi.  T  '"is  significant 

savings  in  space  at  the  expense  of  extra  carputat*  e  algorithm  is 

given  below.  As  before  we  use  the  notation  v^i)  to  represci  f^a). 

Main  program: 

(1)  Compute  n,  v(n) . 

(2  )  Set  y  »-  h(v(n) )  . 

(j5)  y  -  G(v(0)  ,  n,  y)  . 

(h)  HALT  (y)  . 

Recursive  procedure  G(x,i,y): 

(local  counter  j ) , 

(1)  If  i  =  0  then  RETURN(y) . 

(2)  If  i  =  1  then  RETURN(g (x,y) ) . 

[$)  Set  j  -  i/2  (integer  division)  . 

(h)  y  -  G(fj(x),  i-j ,y) . 

(5)  y  -  G(x,  j ,  y)  . 

(6)  RETURN (y). 

The  procedure  G  works  by  dividing  the  given  "interval"  into 
two  parts,  and  calling  itself  recursively  on  the  second  half,  and  then  on 


the  first  half  of  the  interval.  The  algorithm  takes  space  proportional  to 
log  n)  and  time  proportional  to  n.log(n). 

3 -4  Constant -space  algorithm 

We  would  like  to  implement  constant -space  algorithms  using  only 
a  finite  control,  i.e.  we  do  not  wish  to  use  arrays  or  counters.  It  has  been 
shown  (sec.  3.1)  however,  that  bounded  counters  in  the  range  0-n  can  be 
implemented  without  the  use  of  an  explicit  counter.  We  will  thus  allow 
ourselves  the  liberty  of  using  bounded  counters.  We  have  to  be  a  little 
careful  because  incrementing  a  counter  is  no  longer  a  unit  operation,  but 

makes  time  proportional  to  n.  Decrementing  a  counter  and  testing  for  zero, 
however,  remain  unit  operations. 

As  before,  we  will  first  informally  describe  the  order-2 
algorithm  and  then  generate  to  the  higher  order  case,  leaving  the 
details  for  sec.  6.2  of  Appendix  II. 

Order-?  algorithm 

Let  n  be  a  perfect  square  and  let  m  denote  /h  .  In  the 
initialization  phase  n,m  are  evaluated  and  two  values  are  saved  -  v(o) 
and  v(n-m) .  The  latter  is  now  used  as  the  base  for  computing  v(n-l) , 
v(n- '),...,  v(n-m)  in  that  order.  The  advantage  obta  led  by  using  v(n-m) 
base  as  against  v.O)  is  that  the  average  computation  time  for  each 
term  is  only  /„  instead  of  n.  Now  after  the  m  values  have  been  evaluated, 
it  is  time  to  reset  the  base  to  v(n-2m).  This,  of  course,  takes  time  n, 
but  then  these  resets  have  to  be  done  quite  infrequently  -  /„  times.  The 
main  contributors  to  the  computation  time  are  (1)  the  initialization  which 
takes  time  n  ,  (?)  the  actual  computation:  n  steps  taking  n1/2  average  - 

total  n-  ,  and  (3)  resets  of  the  base  value  -  n1?  steps  averaging  „  . 

total  „  .  Thus  the  total  computation  time  is  just  when  2  values 

are  saved . 
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Order 


let 


-k  algorithm 

Generalizing  this  to  an  order-k  constant-space  algorithm 

1  0  be  a  power  of  k,  and  let  m  denote  n1//k. 

1)  The  initialization  phase:  compute  and  save  the  values 

Xk-1  -  v(°)> 

/  k-1. 

Xk-2  *"  v(n"m  )» 


*0  *-  v(n-m)  . 


*-  ck-l  *“ 


Set  counters  c^  »-  c0  «- 
Set  y  h(v(n)) . 

(2)  The  main  computation  phase:  "pick  off"  the  m  values  to  the 
right  of  x o  (including  xQ  itself)  using  :<0  as  the  base, 
and  apply  to  y. 

(3)  The  reset  phase: 

(3-1)  Level  1  reset  ; 

If  c1  =  then  goto  step  3*2. 

Reset  x0  to  a  position  m  steps  to  the  "left"  using  x.^  c 
the  base. 

ci  -  vl- 

Goto  step  2 . 


(3-2)  Level  2  reset: 

If  C2  =  0  then  goto  step  3*3* 

2 

Reset  x^  to  a  position  m‘  steps  to  the  left  using  x 
as  the  base. 

C?-C 2  '  1‘ 
c  -  m 


Goto  step  3.1. 


(•'.k-l)  Level  k-1  reset: 

If  ck_M  =  0  then  goto  step  4. 

k-l 

Reset  x  to  a  position  m  "  steps  to  the  left  using 

K  "c. 

x.  ,  as  the  base, 
k-l 

Ck-1  *"  Ck-1  _1* 

Ck-2 

Goto  step  3.k-2 . 

(4)  HALT(y). 


Figure  3  demonstrates  the  order-1,  order-2  and  order-4  algorithms 
n  =  16 .  Note  that  the  order-1  algorithm  is  precisely  the  conventional 
constant  space  algorithm. 
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Initialization 


After  first  level-1  reset 


After  first  level-2  reset 


After  first  level-3  reset 


After  subsequent  level-2  and 
level-1  resets 


Order-4  constant-space  algorithm 
Fig-  3c 


Figure  3 


The  above  description  deals  with  the  simplified  case  where  n 
is  a  k-th  power.  The  general  case  is  given  in  Section  6.2  of  Appendix  II, 
and  differs  from  the  above  only  in  technicalities. 

Space  requirements 

The  algorithm  saves  k  values.  Strictly,  v(o)  does  not:  have 
to  be  saved  as  it  is  simply  the  constant  a.  In  addition,  there  are  a  fixed 
number  of  bounded  counters  and  additional  variables  for  manipulation  of 
values.  Thus  the  data-space  is  constant  (with  respect  to  n) ,  and 
proportional  to  k.  The  size  of  the  program  (the  number  of  states  of  the 
automaton  for  the  automaton  problem  -  see  Section  5-1)  grows  linearly  with  k 
Time  requirements 

The  running  time  can  be  divided  into 


1. 

2  . 


total  n 


1+l/k 


The  initialization  phase: 
The  computation  phase:  n 


proportional  to 
steps  averaging 


n 


1+l/k 


each  - 


3- 

4. 


Level  1  resets: 
Level  2  resets: 


2-£/k 


averaging  n 
averaging  n 


2/k 

3/k 


each  -  total  n^+^^. 
each  -  total 


k+1.  Level  k-1  resets:  n1//k  averaging  time  n-total  n1+1/kt 

Thus  the  total  running  time  grows  as  nl+l/k  and  the  constant 
of  proportionality  is  linear  with  k. 


In  summary 

Given  a  fixed  amount  of  space  one  can  do  significantly  better 
than  n  ;  in  fact  the  running  time  can  be  made  n1+1/k  for  arbitrarily 
large  k.  Storage  space  grows  linearly  with  k,  as  does  the  complexity  (size) 
of  the  program  (or  finite  state  automaton). 

Constant  space  algorithms  can  be  quite  attractive  because  all 
values  used  in  the  computation  could  be  stored  in  the  registers  of  a 
computer,  and  in  any  case  the  addressing  is  easier  than  the  log(n)  and  the 
linear-time  algorithms. 

It  is  fascinating  to  note  that  if  we  let  p  represent  log-,(n) 
then  the  effect  of  the  order-p  linear-time,  the  order-p  constant -space ,  and 
the  log(n)  algorithms  is  (approximately)  the  same  with  regard  to  the  running 
time  and  storage  requirements. 

The  relationships  between  the  algorithms  described  above  is 
shown  in  Figure  4 . 
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4 .  Conclusions 
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Solving  a  problem  by  first  converting  it  into  a  problem-schema  is  an 
interesting  concept  that  merits  a  great  deal  more  study.  The  advantage 
gained  is  that  the  solution  of  the  schema  can  be  used  to  solve  several  seemingly 
unrelated  problems.  An  associated  advantage  is  that  conversion  to  a  schema 
usually  helps  to  formalize  the  problem  too.  An  example  of  this  is  the 
delineation  of  the  kind  of  statements  allowed  in  a  schema.  Because  of  this, 
however,  some  care  has  to  be  exercised  when  optimal  solutions  are  required 
because  in  this  case  conversion  to  a  schema  requires  more  stringent  conditions: 
for  each  construct  in  the  schema  there  should  exist  a  corresponding  base 
problem  construct,  and  vice  versa.  Also,  simple  changes  in  the  ground  rules 
of  the  base  problem  can  significantly  alter  the  corresponding  schema  problem. 

It  was  not  our  objective  in  this  paper  to  give  optimal  solutions,  just 
to  give  good  solutions  and  observe  the  space-time  tradeoffs  one  can  expect. 

It  nay  have  been  obvious  to  the  reader  that  the  constant -space  algorithms, 
for  example,  are  not  optimal.  The  number  of  base  operations  required  (other 
than  the  control  mechanism)  for  the  order -k  constant -space  algorithm  is 

_k_  .  „1+1A 

2 

whereas 

k  :1//R  n1+1/k 

1+1/k 

is  feasible  with  the  same  space,  representing  a  6$  improvement  for  the 
order-  algorithm,  9$  for  order-3  and  18#  for  order-10;  and  even  in  the 
limit  our  simple  algorithm  does  not  become  arbitrarily  bad  compared  with 
the  other.  The  price  paid  for  the  improvement  is  the  somewhat  greater 

complexity  of  the  contr-'l  structure,  and  an  increased  number  of  counter 
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operations  (which  were  neglected) . 

It  is  reasonable  to  ask  whether  the  algorithms  described  can  have  any 
real  practical  significance.  Exact  machine  times  for  the  algorithms  are 
difficult  to  evaluate  owing  to  machine  dependent  questions  like  register 
allocation,  indirect  addressing  machinery,  cache  allocation,  parallelism  and 
swapping  (in  a  time  shared  system).  We  can  approximate  times,  however,  by 
using  reasonable  assumptions.  In  the  program  we  assume  that  each  of  the  base 
routines  (the  functions  f,g  and  h,  and  the  predicate  p)  takes  40  micro¬ 
seconds  to  evaluate  and  that  operations  on  counters  entail  negligible  cost. 
Storing  a  value  (all  data  variables)  is  assumed  to  take  4  machine  words, 
and  64K  words  of  ore  are  available  to  the  user.  The  following  table  gives 
the  running  times  for  the  various  algorithms  for  recursion  depths  of  16K, 

64k,  6 56k,  and  ' M. 


n 

k=l 

Linear  -  time 
k=2 

k=6 

Log(n) 

k=6 

Const-space 

k=2 

k=l 

16K 

1.9?  sec 

>?7  sec 

5.7I  sec 

6.55  sec 

10.4 

sec 

44  .2  sec 

89*5  min 

64  K 

impos  . 

13 . 1  sec 

22 .8  sec 

28.8  sec 

^7-5 

sec 

5  .75  min 

59-7  hr 

26CK 

impos . 

52 .4  sec 

86.5  sec 

126  sec 

3-67 

min 

45  *4  min 

» 

20  days 

1M 

impos . 

3.49  min 

6.5I  min 

9.O9  min 

21.3 

min 

15  hr 

1  year 

It  is  clearly  indicated  that  for  large  recursion  depths  the  stack 
implementation  is  not  attractive;  and  for  very  large  recursion  depths  even 
the  order-6  linear-time  algorithm  would  approach  the  memory  capacities  if 
implemented  on  present  day  minicomputers  (8K  words  required  for  1M  recursion 
depth)  .  And  finally  it  should  be  pointed  out  that  even  for  relatively  small 
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recursion  depths  higher-order  linear-ti..ie  (and  even  some  constant-space) 
algorithms  may  be  preferred  as  background  jobs  in  a  time  sharing  system 
because  they  need  not  be  swapped  out  as  their  core  requirements  are  quite 
nominal . 

In  the  preceding  discussion  the  model  of  computation  assumes  that  the 
size  of  the  data  structure  remains  bounded  as  computation  proceeds.  Often, 
it  is  more  reasonable  to  assume  that  the  size  of  the  data  increases  with 
the  depth  of  recursion,  as  does  the  time  for  a  unit  operation  on  the  data. 

The  algorithms  presented  in  this  paper  retain  their  significance  under  these 
conditions,  and  if  anything,  beccme  more  useful  vis  a  vis  the  stack  algorithm 
because  space  restrictions  become  more  severe.  For  example,  if  both  time 
for  a  unit  operation  and  the  size  of  the  data  structure  increase  linearly 

with  the  depth  of  recursion,  the  stack  implementation  would  take  space  and 

') 

time  n  whereas  the  so  called  "linear-time"  algorithm  would  take  space 
1+l/k  2 

k.n  '  and  time  k.n“,  and  the  "constant -space"  algorithm  would  take 
space  k.n  and  time  k.n^^k. 


> 


5  .  Appendix  I 


'j  .  1  The  automaton  problem 

What  is  the  time  required  for  a  finite  automaton,  w,  - 

rr  an 

arbitrary  number  of  reading  heads,  to  output  the  symbols  on  its  input 
tape  from  right  to  left?  The  heads  can  only  read  from  left  to  right,  but 
the  automaton  has  the  capability  of  taking  some  reading  head  and  setting 
it  to  the  same  position  on  the  input  tape  as  some  other  head  (note  that  an 
automaton  without  this  capability  cannot  even  perform  the  given  task  for 
arbitrarily  long  input  tapes). 

To  reduce  the  automaton  problem  to  the  schema  problem  the 
following  correspondence  between  the  finite  state  automaton  and  the  schema 
may  be  set  up: 

Finite  state  automaton  Schema 

Head  i 

Move  read  i  to  the  right 


variable  y.. 
>ri  -  f(yi) 


Set  head  i  to  the  same  position 
as  head  j 

Test  if  head  i  is  on  the  last 
character  of  the  tape 

The  output  file 

Output  the  first  character  from 
head  i 

Add  to  the  output  file  from 
head  i 


yi  -  yj 

p(yi) 

a  special  variable  y 

y  -  h(yt) 


y  -  g(yi»y) 

On  comparison  with  the  recursive  schema 
Compute  F(a)  where 

F(x)  if^  p(x)  then  h(x)  else  g(x,F(f(x))) 


¥ 
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we  see  that  if  x  represents  a  square  on  the  input  tape  then  F(x) 
represents  the  value  of  the  output  file  with  all  characters  on  the  right 
of  x  (and  including  it)  written  in  reverse  order.  This  is  obtained  by 
first  writing  all  characters  on  the  right  of  x,  i.e.  F(f(x)),  and  then 
appending  x  to  it,  i.e.  g(x ,F(f (x) ) )  . 

Thus  the  automaton  problem  is  reduced  to  the  schema  problem 
(without  arrays),  but  with  the  constraint  that  the  functions  h  and  g 
can  be  used  only  in  conjunction  with  the  special  variable  y  as  in  the 
statements  described,  and  that  the  statement  y  h(y^)  cannot  be 
executed  more  than  once. 

The  reduction  is  one-way  i.e.  a  solution  of  the  schema  problem 
(with  the  constraints)  gives  a  solution  of  the  automaton  problem.  Of 
course,  the  automaton  may  do  fancy  things  e.g.,  it  may  check  if  its 
entire  tape  contains  just  one  character,  repeated  over  and  over  again,  and 
in  this  special  case  it  could  produce  its  output  in  time  2n.  However,  the 
flowchart  schema  cannot  do  this  as  equality  tests  are  not  allowed. 

It  may  be  argued  that  the  variable  y  requires  not  just  a 
unit  amount  of  space  but  space  proportional  to  n.  However,  since  the 
finite  state  automaton  is  not  expected  to  remember  the  contents  of  its 
output  file  we  may  consider  that  y  takes  zero  space.  Hence  the 
assumption  that  all  variables  take  unit  space  gives  a  value  for  memory 
requirement  one  greater  than  the  number  of  heads  required  by  the  automaton. 

5 .2  The  list  problem 

Given  a  one  way  list,  to  output  the  elements  of  the  list  in 
reverse  order.  We  are  not  allowed  to  change  the  pointers  of  the  list 
itself  as  in  the  case  where  the  list  structure  is  common  to  several 
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concurrent  processes;  and  we  ask  what  are  the  time -memo --y  tradeoffs. 

0 - >  0  - $>  0  - >  0  ...  - 5>  0 

This  problem  is  a  generalization  of  the  automaton  problem  because  our 
random-access  computer  has  several  features  not  available  to  the  finite 
^  automaton;  the  number  of  pointers  into  the  list  structure  can  vary  with  the 

size  of  the  given  list,  two  pointers  can  be  tested  to  see  if  they  happen 
to  point  to  the  same  node,  etc.  In  the  special  case  where  we  restrict 
our  computer  to  have  the  capability  of  a  finite  automaton  we  obtain  the 
automaton  problem. 

The  reduction  of  the  list  problem  to  the  schema  problem  is 
analogous  to  the  reduction  of  the  automaton  problem,  except  that  in  this 
case  counters  and  arrays  are  allowed.  Arrays  can  be  used  to  hold  pointers 
into  the  list.  Pointers  are  analogous  to  the  heads  of  the  automaton.  However, 
as  the  arrays  are  semi-infinite,  the  number  of  pointers  can  increase  with 
the  size  of  the  list  structure.  As  in  Section  5.1,  there  is  a  special 
variable  y  representing  the  output  file  and  the  only  operations  allowed 

* 

on  y  are  y  -  h^),  and  y  -  g(yi,y). 

* 
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Appendix  II 

6 . 1  The  Linear-Time  Algorithm 
START 

counter  i.m.n.c  . c.  .d  . d  : 

-  0  k’  0  k’ 

data  x,y; 
array  A^...^; 

STEP1 :  if  p (a)  then  HALT (h (a)) ; 
n  «-  0;  x  *-  a; 

—  while  -ip  fa)  do  begin  x  *-  f(x);  n  *-  n  +  1  end ; 

y  -  h(x) ; 

1/k. 


m  ♦—  n 


d0  -  1;  di*"  m ;  d2~  m"  5  •  •  *  ;  V 

f  •-  n;  x  *-  a; 


cQ  -  cx - ck  *"  0’ 


m 


AjJ  0]  *-  x  ; 


while  l  >  d.  do 
— -  k  — 


hesili  f  -  i  -  dk;  X  ~  fdk(x);  cR  -  ck  +  1;  A^ 


AQ[0]  -  x; 
while  l  >  d^  do 

begin  f  *-  *  -  dQ;  x  -  fd°(x);  cQ  *-  cQ  +  1;  A[cQ] 
STEP2:  y  -  g(A[c0]  ,y)  ; 

if.  cq  =  0  then  goto  STEPJ; 


goto  STEP2 ; 


STEP_J : 


STEP3.I:  if  c1  =  0  then  goto  STEP3 .2 ; 

ci  <-  ci  ~  l; 

Aq[0]  -  X  -  A.J  c^  ; 

for  Cq  •-  1  step  1  until  m-1  do  A^[  c^J  •-  x  •-  f^O(x)  ; 
goto  STEP2 ; 


STEPJ.k:  i_f  c^=  0  then  roto  STEP4 ; 
ck  ck  ”  1; 

\-l  to]  -  ^  -  \tck]; 

for  *-  1  step  1  until  m-1  do  A^  *-  x  •-  f^k-l(x); 

STEP4:  HALT(y). 

The  program  follows  the  algorithm  of  Section  3*2  very  closely. 

In  the  initialization  phase  (step  1)  line  (1)  computes  the  value  of  n. 

Line  (?)  assigns  to  the  counter  m  the  largest  value  such  that  m  <  n. 

Note  that  this  can  be  done  just  with  the  operations  of  +1,  -1  and  test 

for  zero  in  linear  time.  Line  (3)  computes  the  relevent  powers  of  m 

(these  can  be  computed  simultaneously  while  m  is  being  computed).  The 

counters  c-,...,c,  denote  the  number  of  values  saved  at  each  level.  There 
0  k 

is  some  overlap  in  values  saved  which  could  be  avoided.  As  shown  the 
initialization  phase  involves  two  passes  over  the  range  of  data  values  a 
through  fn(a).  This  can  be  done  in  a  single  pass  for  k=l  since  all 
increments  are  constant  (one)  independent  of  the  value  of  n. 


6 .2  The  Constant -space  Algorithm 


f 


> 


f 


r 


t 


t 


START 


counter  i»nijn,CQ, .  .  .  ,Cj^,d  ,  . . .  ,d^j 
da^L  y,xQ>. . .  .x^; 

STEP  1 :  if  p(a)  then  HALT(hfa) ) ; 
n  -  0;  x  <-  a; 

while  ~ip(a)  do  begin  x  *-  f  (x)  ;  n  *-  n  +  1  end ; 


y 

m 


h(x) ; 

nVk. 


d0  *”  1;  dl  *“  m; 
£  -  n;  x  -  a; 


2 

m 


Xk  -  x; 

while  £  >  d^  do 

begin  i  -  l  -  d^;  x 


k 
m  ; 


fdk(x);  c. 


c,  +  1  end ; 
k  - 


t 


» 


while  £  >  d^  do 

begin  l  -  £  -  d^,;  x  —  fd0(x);  ►  cQ  +  1  end ; 

STEP2:  y  -  g(fC0(x0),y); 
if  c.  =  0  then  goto  STEP^; 

co  -  co  -  1; 

goto  STEP2 ; 

STEP3  : 

STEP3-1:  ii_c^  =  ^  then  goto  STEP3»2> 
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'0 


m  -  1 ; 


goto  STEP2  ; 


STEP3 .k:  if.  c  =  0  then  goto  STEP4  ; 


go co  STEP3 . k— 1 ; 
STEP4:  HALT(y). 


The  initialization  phase  is  shown  here  for  the  case  where  explicit 
counters  are  allowed.  It  closely  parallels  the  initialization  phase  in  the 
linear-time  program  (Section  6.1).  The  rest  of  the  program  can  be  implemented 
using  only  the  counter  operations  -1  and  test  for  zero  which  means  it  can  be 
directly  implemented  without  any  explicit  counter  (see  Section  3-1). 

The  initialization  phase  can  be  implemented  without  any  explicit 
counters  as  follows.  Variables  m ,c  . . . ,c  ,d  , . . .  ,d  are  used  to 

U  K  vj  k 

represent  the  corresponding  counters  in  the  rest  of  the  program.  In  addition, 
variables  m7 ,x7 ,c '  , . . . ,c are  used  as  temporaries.  We  make  use  of  the 
following  nonrecursive  procedures  for  convenience  in  defining  the  operation 
of  the  program. 

procedure  invert  (c7); 
begin  local  c,d; 
c  *-  a ;  d  c  7 ; 

while  — ip fd )  do  begin  c  «-  f(c);  d  f(d);  end ; 
return (c)  ; 
end ; 
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m '  *-  a; 

y  -  h(invert(m')) ; 


U 


TRY:  m'  f(m');  m  *-  invert  (in' ); 

l  -  a> 

c,  *-  m ; 

TRY  :  if  p(c.^)  then  goto  TRY: 

C1  *"  f^Cl')  ’ 
cp  -  m; 

TRY,-,  :  ij_  p (c  )  then  goto  TRY,  ; 

co  -  f(c2); 

c,  -  m; 


& 


TRY^:  i_f  >?(ck)  then  goto  TRY^  ^ ; 

ck  -  f<ck); 

if.p(i)  then  goto  FOUND; 

t  -  f(l); 
goto  TRYk ; 

FOUND:  m  *-  f(m)  ;  comment :  m  has  now  been  found; 

-  invert (f(a)) ; 
d ^  -  m; 

d2  ~  multiply (d1,d1)  ; 

d^  -  multiply (d^ ,dk_p ;  comment :  d  , . . . ,d^  have  been  determined; 
x  •-  a; 


1+1 


1 


x  •-  x; 
k 

while  —ip(right(x,d^))  do 

begin  x  right(x,dk);  c'k  •-  f(c'k)  end; 


while  -ip(right(x,d0))  do 

begin  x  -  right (x,d0) ;  c'  -  ffc'^end; 

*•  invert  (c '  ) ;  ...  ;  «-  invert  (c'k); 

comment :  this  completes  the  initialization; 
This  complete*  ibe  description  of  the  constant -space  algorithm. 
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